Integrated Linguistic Resources for Language Exploitation Technologies

نویسندگان

  • Stephanie Strassel
  • Christopher Cieri
  • Andrew Cole
  • Denise DiPersio
  • Mark Liberman
  • Xiaoyi Ma
  • Mohamed Maamouri
  • Kazuaki Maeda
چکیده

Linguistic Data Consortium has recently embarked on an effort to create integrated linguistic resources and related infrastructure for language exploitation technologies within the DARPA GALE (Global Autonomous Language Exploitation) Program. GALE targets an end-to-end system consisting of three major engines: Transcription, Translation and Distillation. Multilingual speech or text from a variety of genres is taken as input and English text is given as output, with information of interest presented in an integrated and consolidated fashion to the end user. GALE's goals requires a quantum leap in the performance of human language technology, while also demanding solutions that are more intelligent, more robust, more adaptable, more efficient and more integrated. LDC has responded to this challenge with a comprehensive approach to linguistic resource development designed to support GALE's research and evaluation needs and to provide lasting resources for the larger Human Language Technology community.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multipurpose Design of Greek Sign Language Resources: a Factor towards Universal Access

In this paper we present the methodology of data collection and implementation of databases for the creation of extensive lexical and terminological resources for the Greek Sign Language (GSL) in order to introduce the major issue of dynamic sign representation. In respect to electronic linguistic resources of GSL, the focus is on issues of validation of linguistic content, multipurpose design ...

متن کامل

Integrated Environment for Management and Exploitation of Linguistic Resources

act — In this paper we describe two tools that form an ed environment which can be successfully used for ment and exploitation of linguistic resources. Both the d the resources were developed within the University of e Human Language Technology Group. The tools we are WS4LR, a software tool that has been developed d for solving different tasks within the Group, and a lication named WS4QE, accom...

متن کامل

Selection of Foreign Language Teaching Content in Russian Master of Laws (LLM) Graduate Programs

Master`s degree was integrated into the system of Russian Higher Education several decades ago, however, teaching foreign languages at this level still needs further analysis including the postgraduate law students training. The article investigates the principal components of foreign language teaching in Master of laws Graduate Programs (considering the case of the English language) on the bas...

متن کامل

Integrated Language Technologies for Multilingual Information Services in the MEMPHIS Project

The MEMPHIS project integrates a large set of NLP technologies. An overview of components, their underlying technologies and resources will be presented: language identification, document classification, linguistic analysis, summarization, information extraction, machine translation, knowledge management and crosslingual retrieval.

متن کامل

Sign Language & Linguistics

The work reported in this study is based on research that has been carried out while developing a sign synthesis system for Greek Sign Language (GSL): theoretical linguistic analysis as well as lexicon and grammar resources derived from this analysis. We focus on the organisation of linguistic knowledge that initiates the multi-functional processing required to achieve sign generation performed...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006